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EFFECTS  OF  AVERAGING  DATA  DURING 
STIMULUS  GENERALIZATION 

Bernard  Migler 

WALTER  REED  ARMY  INSTITUTE  OF  RESEARCH1 


Rats  were  trained  to  press  two  keys  consecutively  for  reinforcement.  During  stimulus  one 
(slow  clicker)  a  6-sec  time  delay  was  required  between  the  two  responses.  During  stimulus  eight 
(fast  clicker)  no  time  delay  was  required  between  the  two  responses.  When  tested  with  inter¬ 
mediate  stimuli  (intermediate  click  rates)  the  median  time  delays  emitted  by  the  animals  were 
intermediate  between  their  performances  on  ^he  original  training  stimuli,  resulting  in  typical 
generalization  gradients.  Closer  examination  of  the  data  revealed  that  the  median  values  were 
not  representative  of  the  behavior  of  the  animals. 


Stimulus  generalization  gradients  are  con¬ 
ventionally  considered  to  have  the  form  of  a 
sloping  gradient  of  response  rates  away  from 
the  reinforced  stimulus  (Guttman  and  Kalish, 
1956).  Some  exceptions  have  been  reported. 
Hearst  (1962)  obtained  Hat  generalization 
gradients  using  an  avoidance  procedure  prior 
to  discrimination  training  but  obtained  a  con¬ 
ventional  sloping  gradient  of  response  rates 
after  discrimination  training,  as  did  Sidman 
(1961).  A  second  exception  concerns  the  peak 
shift  phenomenon  following  discrimination 
training  (Hanson,  1959),  where  the  sloping 
gradient  is  obtained  but  from  a  point  dis¬ 
placed  from  the  originally  reinforced  stimulus 
in  a  direction  away  from  the  unreinforced 
training  stimulus.  In  general,  a  sloping  gradi¬ 
ent  of  response  rates  (and  presumably  response 
strength)  is  obtained  when  the  trained  subject 
is  tested  on  stimuli  removed  from  the  training 
stimuli.  This  general  result  has  survived  a 
variety  of  testing  procedures.  Herrnsiein  and 
van  Sommers  (1962)  for  example,  used  five 
training  stimuli  and  four  test  stimuli  inter¬ 
mediate  between  the  five  training  stimuli. 
They  found  that  the  response  rates  in  the  in¬ 
termediate  test  stimuli  were  intermediate  in 
relation  to  the  rates  in  the  adjacent  training 
stimuli.  Another  variation  in  which  two  in¬ 
compatible  responses  were  available  to  the 
subject  has  been  used.  Honig  and  Day  (1962) 
trained  animals  to  respond  to  one  key  (iden¬ 


tity  key)  when  there  wras  no  difference  in  wave¬ 
length  between  two  illuminated  keys  and  to 
peck  at  a  second  key  (difference  key)  when 
there  was  a  difference  of  40  millimicrons  (m^) 
in  wavelengths.  When  tested  at  wavelength 
differences  from  0  to  70  lii/z  there  was  a  grad¬ 
ually  decreasing  amount  of  responding  to  the 
identity  key  as  the  difference  increased,  and  an 
increasing  amount  of  responding  to  the  differ¬ 
ence  key.  La  Berge  (1961)  found  a  parallel  re¬ 
sult  with  human  subjects  using  a  two-response 
situation. 

Some  investigators,  however,  have  raised 
questions  about  accepting  the  general  shape  of 
these  generalization  gradients.  Blough  (1963) 
showed  that  generalization  test  stimuli  princi¬ 
pally  affected  the  long  interresponse  times, 
rather  than  the  short  ones  after  variable  inter¬ 
val  training.  Cross  and  Lane  (1963)  using  two 
topographically  discontinuous  (or  continuous) 
responses  found  sharply  sloping  gradients  of 
generalization  i elating  response  probability  (of 
one  response  or  the  other)  to  stimulus  inten¬ 
sity.  Under  the  particular  conditions  of  the 
present  experiment,  gradually  sloping  general¬ 
ization  gradients,  presumably  implying  grad¬ 
ually  changing  strengths  of  behavior,  were 
obtained.  But  they  were  found  to  be  the  result 
of  averaging  procedures,  and  actually  unrepre¬ 
sentative  of  the  behavior  of  the  animals. 

METHOD 


Subjects 

'Reprints  ma>  be  obtained  from  the  author.  Institute  „  , 

for  Behavioral  Research,  2426  Linden  Lane,  Silver  I  Wo  experimentally  naive  adult  male  albino 

Spring.  Maryland.  rats  served. 
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Apparatus 

The  Ss  were  housed  and  tested  in  a  special 
purpose  device  called  an  elevator,  described 
earlier  (Migler  and  Brady,  1964).  In  this  de¬ 
vice  the  living  cage  becomes  the  testing  cage 
when  the  testing  panel  is  positioned  in  front  of 
the  living  cage.  The  testing  panel  contained 
two  response  keys  that  could  be  illuminated 
from  behind,  pellet  feeder,  and  speaker.  Under 
program  control  this  panel  moved  up  from  the 
home  position  at  the  bottom  of  the  elevator, 
not  facing  any  cage,  to  a  cage  in  which  a  rat 
lived  for  a  test  session.  Then  it  moved  up  to 
the  second  rat  for  another  session  and  then 
home  to  the  bottom  of  the  elevator. 

Conventional  relay  circuitry  was  used  to 
program  the  contingencies,  and  the  data  were 
recorded  on  punched  paper  tape  for  later  sort¬ 
ing  and  computation  by  a  digital  computer. 

Procedures 

Water  was  available  at  all  times  from  a 
water  bottle  mounted  on  the  cage.  All  nourish¬ 
ment  was  obtained  from  the  97  mg  food  pellets 
earned  as  reinforcements  during  the  daily  test 
ing  sessions.  The  number  of  reinforcements 
per  session  ranged  from  about  100  to  400.  The 
use  of  the  elevator  and  the  punched  tape  re 
cording  system  permitted  seven  day  per  week 
training.  Daily  sessions  were  10  hr  in  duration 
for  each  rat  with  14  hr  of  deprivation  between 
sessions  for  each  rat. 

Preliminary  Training .  The  rats  were  first 
food  deprived  for  24  hr,  then  over  successive 
days,  were  maga/ined  trained  and  trained 
under  CRF  conditions  to  press  the  illuminated 
response  key  on  the  left  side  of  the  cage  (key 
B).  Subsequently,  over  several  sessions  key  A 
on  the  right  hand  side  of  the  test  panel  was 
illuminated  and  key  B  was  darkened.  Pressing 
key  B  when  darkened  had  no  effect,  pressing 
key  A  when  illuminated  extinguished  the  light 
on  key  A  and  illuminated  key  B.  A  press  on 
key  B  at  this  time  was  reinforced  with  a  pellet 
delivered  into  a  trough  between  the  keys  near 
the  floor,  and  the  illumination  was  switched 
from  key  B  back  to  key  A.  After  the  response 
A  to  response  B  sequence  was  established,  il 
lumination  of  key  A  after  a  key  B  response 
was  discontinued,  so  that  the  cage  was  dark 
until  the  rat  pressed  key  A  which  illuminated 
key  B  and  started  the  run.  When  key  B  was 
pressed  the  cage  was  darkened  and  reinforce¬ 


ment  delivered.  The  purpose  of  this  change 
was  to  provide  for  the  later  use  of  a  limited 
hold  procedure  in  which  an  excessive  delay 
after  response  A  would  be  indicated  immedi¬ 
ately  by  darkening  of  the  cage. 

Preliminary  training  continued  with  the  in¬ 
troduction  of  a  1-sec  minimum  delay  require¬ 
ment  between  response  A  and  response  B. 
Only  if  the  rat  pressed  key  B  at  least  1-sec  after 
it  had  pressed  key  A,  was  the  food  pellet  deliv¬ 
ered.  A  to  B  response  times  shorter  than  1  sec 
were  not  reinforced.  As  training  progressed 
this  minimum  was  gradually  increased  to  6  sec. 
The  limited  hold  was  added  at  this  point. 
Previously,  the  B  light  remained  on  indefi¬ 
nitely  after  the  A  response.  Now  a  limit  was 
imposed,  so  that  15  sec  after  the  A  response 
had  occurred  the  cage  was  darkened,  and  the 
rat  had  to  press  key  A  again  to  set  up  a  rein¬ 
forcement  on  key  B. 

The  next  major  step  was  to  establish  an 
auditory  discrimination.  The  lighting  arrange¬ 
ment  on  the  keys  was  maintained  as  before 
and,  in  addition,  when  the  trial  (or  run)  was 
started  by  a  press  on  key  A  a  clicker  was 
sounded  until  response  B  was  made,  or  until 
the  limited  hold  reset  the  trial  after  the  15-sec 
limit.  The  click  frequency  w^as  either  2.5  per 
sec  (stimulus  one)  or  45.8  per  sec  (stimulus 
eight)  distributed  randomly  at  50%  each. 
When  stimulus  eight  was  on,  reinforcement  on 
key  B  was  immediately  available,  and  when 
the  A  response  turned  on  stimulus  one  the 
usual  6-sec  minimum  delay  was  in  effect.  Forty- 
five  discrimination  training  sessions  were  nec¬ 
essary  (including  some  sessions  in  which  only 
stimulus  ot'.e  or  stimulus  eight  was  in  effect  for 
the  entire  session)  to  produce  a  clear  separa¬ 
tion  of  the  behaviors  in  the  two  stimuli. 

In  the  next  and  last  step  in  preliminary 
training,  reinforcement  was  delivered  only 
after  50%,  of  the  appropriate  A  to  B  responses 
(10  sessions),  and  then  after  only  about  33%, 
(20  sessions).  That  is,  during  stimulus  eight 
only  about  one-third  of  the  sequences  were 
reinforced.  During  stimulus  one  only  one- 
third  of  those  sequences  over  the  6-sec  mini¬ 
mum  were  reinforced;  the  limited  hole!  was  in 
effect  in  both  cases.  This  intermittent  rein¬ 
forcement  was  introduced  to  prevent  rapid  ex¬ 
tinction  during  generalization  icsting,  which 
was  carried  out  in  extinction. 

Generalization  Testing.  On  the  generaliza¬ 
tion  testing  days,  the  procedure  was  modified 
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as  follows:  every  tenth  trial  constituted  a  test; 
when  the  rat  pressed  key  A,  key  B  was  illumi¬ 
nated  as  usual  and  a  clicker  sounded.  Hie 
click  rate  of  this  test  stimulus,  however,  was 
different  from  the  dick  rate  in  the  training 
stimuli.  The  other  nine  trials  were  regular 
training  trials  with  either  the  2.5  per  sec 
clicker  (stimulus  one)  or  the  45.8  per  sec 
clicker  (stimulus  eight)  each  occurring  50%  of 
the  time.  A  to  B  responding  during  the  train¬ 
ing  stimuli  was  reinforced  as  usual  on  VR  3. 
The  temporal  requirements  were  maintained 
in  stimuli  one  and  eight.  Six  different  test  click 
rates  (see  Fig.  1)  were  investigated  in  an  irreg¬ 
ular  order,  but  only  one  per  session,  and  each 
was  examined  twice  for  a  total  of  12  test  ses¬ 
sions. 


RESULTS  AND  DISCUSSION 

Figure  2  presents  the  performance  of  each 
rat  on  the  last  day  of  preliminary  training. 
The  time  in  seconds  from  response  A  to  re¬ 
sponse  B  is  presented  on  the  abcissa,  and  the 
relative  frequency  (or  percent  of  total)  of  each 


STIMULUS 

Fig.  1.  Stimuli  used  during  experiment.  Training 
stimuli  were  stimulus  1  (2.5/sec)  with  a  6-sec  minimum 
delay  required  between  responses,  and  stimulus  8 
(15.8/sec)  with  no  delay  tequired.  Stimuli  2-7  were  test 
stimuli. 
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Fig.  2.  Performance  of  both  rats  on  the  training  stim¬ 
uli  1  (6-sec  delay)  and  8  (no  delay).  Performance  in 
stimulus  1  is  displaced  upward  for  clarity. 


A  to  B  time  is  on  the  ordinate.  The  perform¬ 
ance  for  the  entire  session  during  stimulus 
eight  when  no  delay  was  required  is  shown  on 
the  bottom  of  each  rat's  set  of  curves  with  the 
peak  shifted  left  toward  the  fast  A  to  B  times. 
The  performance  during  stimulus  one  is 
merely  displaced  upward  for  clearer  separa¬ 
tion  and  for  direct  comparison  of  the  two  be¬ 
haviors.  The  figure  indicates  that  the  discrimi¬ 
nation  was  well  established  in  both  rats  and 
that  the  behavior  in  each  stimulus  was  appro¬ 
priate  to  the  temporal  contingencies  in  that 
stimulus. 

Figure  3  shows  some  of  the  results  of  the 
performance  of  the  rats  during  the  test  stimuli. 
The  median  A  to  B  time  is  given  on  the  ordi¬ 
nate  in  this  case,  and  the  two  curves  for  each 
rat  indicate  the  two  tests  per  stimulus.  The 
figure  presents  a  generalization  gradient  in 
which  the  median  A  to  B  time  is  roughly  a 
continuous  function  of  the  click  rate.  During 
intermediate  stimuli  the  median  A  to  B  times 
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Fig.  3.  Median  A  to  B  time  in  the  six  test  stimuli  for 
both  rats. 


are  intermediate  between  the  performance  in 
the  training  stimuli. 

However,  this  method  of  treating  the  data 
presents  a  distorted  description  of  the  behav¬ 
ior  of  the  animals.  Figures  4  and  5  present  the 
data  in  another  way.  Rather  than  averaging 
the  data  in  each  test  stimulus  by  presenting 
the  medians  (or  means,  which  presented  the 
same  picture),  Fig.  4  and  5  present  the  relative 


Fig.  4.  Rat  CG-1.  Relative  frequency  distributions  for 
performance  during  all  12  test  sessions  (two  tests  per 
test  stimulus).  The  left-hand  column  of  curves  shows 
performance  on  the  original  training  stimuli  for  the  12 
sessions  wilh  the  performance  in  stimulus  1  displaced 
upward.  The  right  hand  column  of  figures  shows  the 
pel  forma nre  (and  its  replication)  in  each  of  the  six  test 
stimuli. 


frequency  distribution  of  A  to  B  times  in  each 
test  stimulus. 

The  abcissa  represents  the  A  to  B  times  in 
seconds  and  the  relative  frequency  of  each  A 
to  B  time  is  on  the  ordinate.  The  same  magni¬ 
tudes  of  the  coordinates  were  used  in  all  the 
curves  in  the  figure.  The  lefthand  column  of 
curves  shows  the  performance  on  the  regular 
training  stimuli,  one  and  eight,  during  the  12 
test  sessions.  The  behavior  in  these  stimuli 
were  not  affected  in  any  systematic  way  by  the 
testing  procedure.  The  main  effect  of  the  ex¬ 
periment  can  be  seen  in  the  right-hand  col¬ 
umn  of  curves.  Progressing  from  test  stimulus 
two  through  test  stimulus  seven,  the  change  in 
performance  that  takes  place  is  the  disappear¬ 
ance  of  behavior  appropriate  to  stimulus  one 
(slow  A  to  B  times)  and  the  emergence  of 
behavior  appropriate  to  stimulus  eight  (fast  A 
to  B  times),  without  the  emergence  of  inter¬ 
mediate  behaviors  in  the  intermediate  stimuli. 

(dearly,  the  intermediate  test  stimuli  do  not 
generate  intermediate  behaviors,  but  only 
mixtures  of  behaviors,  composed  of  fast  A  to 
B  times  and  slow  A  to  B  times.  The  effect  of 
the  test  stimuli  was  the  control  over  the  rela¬ 
tive  proportions  of  slow  or  fast  A  to  B  times 
in  the  mixture. 

Therefore,  in  this  case  at  least,  the  sloping 
generalization  gradients  seen  in  Fig.  3  were 
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Fig.  R.  Rat  CXi-2.  Sec  caption  for  Fig.  4. 
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behaviors UC  l°  e**m  °*  ilveraRinK  different 

whPilll  a^>rW|W'ate  to  *,ujuire  at  this  point 
thf  l,resent  results  are  related  to  other 
generalization  testing  procedures,  e.g.,  where 
-annnal^  trained  on  a  variable  interval 
hedule  of  reinforcement  in  one  stimulus  and 
inguished  in  a  second  stimulus  and  re¬ 
sponse  rates  during  generalization  testing  are 
mnd  to  be  a  declining  function  of  distance 

Ik  Jr  P°slt,ve  training  Stimulus  (or  from 
e  shifted  peak).  A  procedure  which  might 
‘  luminate  the  relevance  of  the  present  results 

.V'Vr?  W°U,<1  retluire  animals  to  respond 
the  left  key  (in  a  two-key  chamber)  on  a  Vl 

°1,  say>  15  sec>  in  the  presence  of  a  low  fre¬ 
quency  click  rate  and  on  the  right  key  on  a  VI 
of,  say,  60  sec  in  the  presence  of  a  high  fre- 
quency  click  rate.  During  intermediate  test 
click  rates,  the  results  of  the  present  experi¬ 
ment  would  predict  that  the  animals  would 
respond  part  of  the  time  to  the  left  key  and 
part  ol  the  time  to  the  right  key.  The  average 
response  rate  for  the  two  keys  combined  dur¬ 
ing  a  test  stimulus  would  probably  be  inter¬ 
mediate  between  the  rates  on  occurring  during 
t  te  training  stimuli.  However,  the  rates  on  the 
individual  keys  would  not  be  intermediate  VI 
rates  but  would  be  the  rates  characteristic  of 
the  15-sec  VI  when  the  animal  was  working  on 
the  left  key  and  the  rate  characteristic  of  the 
60-sec  VI  when  working  on  the  right  key.  The 
overall  intermediate  rate  would  therefore  be 
an  artifact  of  averaging  these  different  behav¬ 


H07 

iors.  Should  such  an  outcome  be  approxi¬ 
mately  correct  it  would  suggest  that  the  sloping 
generalization  gradients  of  response  rates  ob¬ 
tained  with  the  single  manipulandum  proced¬ 
ure  may  be,  in  part,  the  result  of  inappropri¬ 
ate  averaging  procedures. 
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